2 resultados para Automatic Thoughts

em National Center for Biotechnology Information - NCBI


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The Dali Domain Dictionary (http://www.ebi.ac.uk/dali/domain) is a numerical taxonomy of all known structures in the Protein Data Bank (PDB). The taxonomy is derived fully automatically from measurements of structural, functional and sequence similarities. Here, we report the extension of the classification to match the traditional four hierarchical levels corresponding to: (i) supersecondary structural motifs (attractors in fold space), (ii) the topology of globular domains (fold types), (iii) remote homologues (functional families) and (iv) homologues with sequence identity above 25% (sequence families). The computational definitions of attractors and functional families are new. In September 2000, the Dali classification contained 10 531 PDB entries comprising 17 101 chains, which were partitioned into five attractor regions, 1375 fold types, 2582 functional families and 3724 domain sequence families. Sequence families were further associated with 99 582 unique homologous sequences in the HSSP database, which increases the number of effectively known structures several-fold. The resulting database contains the description of protein domain architecture, the definition of structural neighbours around each known structure, the definition of structurally conserved cores and a comprehensive library of explicit multiple alignments of distantly related protein families.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Comparative genomics offers unparalleled opportunities to integrate historically distinct disciplines, to link disparate biological kingdoms, and to bridge basic and applied science. Cross-species, cross-genera, and cross-kingdom comparisons are proving key to understanding how genes are structured, how gene structure relates to gene function, and how changes in DNA have given rise to the biological diversity on the planet. The application of genomics to the study of crop species offers special opportunities for innovative approaches for combining sequence information with the vast reservoirs of historical information associated with crops and their evolution. The grasses provide a particularly well developed system for the development of tools to facilitate comparative genetic interpretation among members of a diverse and evolutionarily successful family. Rice provides advantages for genomic sequencing because of its small genome and its diploid nature, whereas each of the other grasses provides complementary genetic information that will help extract meaning from the sequence data. Because of the importance of the cereals to the human food chain, developments in this area can lead directly to opportunities for improving the health and productivity of our food systems and for promoting the sustainable use of natural resources.